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Abstract 

Effective field theories encode the predictions of a quantum field theory at low energy. 
The effective theory has a fairly low ultraviolet cutoff. As a result, loop corrections are 
small, at least if the effective action contains a term which is quadratic in the fields, and 
physical predictions can be read straight from the effective Lagrangean. 

Methods will be discussed how to compute an effective low energy action from a 
given fundamental action, either analytically or numerically, or by a combination of both 
methods. Basically, the idea is to integrate out the high frequency components of fields. 
This requires the choice of a "blockspin" , i.e. the specification of a low frequency field as a 
function of the fundamental fields. These blockspins will be the fields of the effective field 
theory. The blockspin need not be a field of the same type as one of the fundamental fields, 
and it may be composite. Special features of blockspins in nonabelian gauge theories will 
be discussed in some detail. 

In analytical work and in multigrid updating schemes one needs interpolation kernels 
A from coarse to fine grid in addition to the averaging kernels C which determines the 
blockspin. A neural net strategy for finding optimal kernels is presented. 

Numerical methods are applicable to obtain actions of effective theories on lattices of 
finite volume. The special case of a "lattice" with a single site (the constraint effective 
potential) is of particular interest. In a Higgs model, the effective action reduces in this 
case to the free energy, considered as a function of a gauge covariant magnetization. Its 
shape determines the phase structure of the theory. Its loop expansion with and without 
gauge fields can be used to determine finite size corrections to numerical data. 
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1 What are Effective Field Theories? 

We work in the context of Euclidean field theory. 

• An effective field theory encodes the predictions of a quantum field theory at low energies 

• It is a field theory with a low UV- cutoff - of the order of the relevant particle masses. 
For instance it may live on a lattice of lattice spacing oq. 

• Ideally one should be able to read the physics straight from the effective action (= action 
of the effective theory) because loop corrections to tree amplitudes are small at sufficiently 
low UV-cutoff. 
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Problem to be discussed 

Start from a given fundamental action (on the continuum or on a lattice of small lattice spacing 

a <^ ao) 

• How does one define and compute the effective action! 

Sometimes effective actions are guessed on the basis of known properties. 

Example: chiral theories (nonlinear cr-models) as effective theories of mesons. They em- 
body the low energy theorems implied by PCACQ These low energy theorems are sufficiently 
restrictive to fix the effective action at really low energy (i.e. low cutoff) up to a few coupling 
constants. By the definition of an effective theory, the question of the removal of the UV-cutoff 
does not pose itself. Nonrenormalizable a-models are therefore acceptable. 

But ideally, one would like not to guess the effective theory, but to justify it from the given 
fundamental theory. 

Definition of the effective action in terms of the fundamental theory 

The effective action will be a function of low frequency fields $ - for instance, fields $ on a 
lattice of lattice spacing oq. They should be defined as functions of the fundamental fields ip, 

$ = C{ip) , 

If C is linear, we write instead 

^ = Cip . 

C will be called the averaging operator and $ is called blockspin. Later on we will consider 
fundamental theories which are gauge theories, and we will distinguish in notation between 
matter fields (p and gauge fields U. In general the low frequency matter fields will be functions 
not only of the matter fields ip but will also depend on the gauge fields, 

$ = C{U,^) , or $ = C{U)ip . 

The definition of the effective action Tig// in terms of the fundamental Euclidean action Ti.{ip) 
reads 

= JVip 5($ - Ciip)) e-^(^) (1) 
If the effective theory lives on a lattice Aq, then 

5($ - c{^)) = n n ^i^aix) - c^{^){x)) 

when $ has components The 1-dimensional 5-function admits the integral representation 

1 POO 

ZTI J-oo 

This formula, or (better) an approximation of the (5-function by a Gaussian 

= I e'^V^P = ^ r e''^ e-^'"/' (/? small) 
\ p J 27r J-oo 

can also be used when there are Fermi fields. 
^ PCAC = partially conserved axial vector current. 

^Euclidean field theory may be regarded as a classical statistical mechanics. In this language, the action 
becomes the Hamiltonian. Therefore we use the letter H. 



3 



Problems 



• On what number and kinds of fields should Tie// depend ? 

• How to choose the averaging operator C . 

Averaging over space time regions will be involved in getting low frequency fields. 

• How to compute the functional integral ([l|) by analytical or numerical means. 

• How to detect a had choice of C or $. 

The answer to the last question is given by the requirement that the effective action must have 
good locality properties. 

Let us emphasize that Tie// is exact in the sense that it yields exact expectation values for 
observables O which depend on the fundamental fields ip only through <I>q, = Ca{'-p) so that 
O = F{C{ip)). 

(O) = ^jvvF{C{v))e-'^^^^ 

= 1 /■ D$ e-'^efsm _ (2) 

We illustrate the issues involved as well as the dangers of a bad choice of $ or C at an example 
which is well understood. 

Example: Discrete Gaussian model. 

We consider first the case where the fundamental theory is a discrete Gaussian model in 
3 dimensions. It lives on a lattice A of lattice spacing a. This model is the dual transform 
of a f/(l) lattice gauge theory in 3 dimensions. An exact computation of its effective action 
by convergent series expansions was used in IQ to give a rigorous proof that the 3-dimensional 
?7(l)-lattice gauge theory shows linear confinement for all values of the gauge coupling constant 
^,with [3 = An^/g^. 

The points of the fundamental lattice will be denoted by letters z. The fundamental field 
is integer valued , 

n{z) e Z , 

and the fundamental action is 

The difficult case is when 13/ a is large (i.e. weak gauge coupling g.) Note that P/a is dimen- 
sionless in 3 dimensions. 

A choice was made to use a Pauli-Villars cutoff for the effective theory, so that the effective 
theory lives on the original lattice A, to begin with. The cutoff was chosen to be of the order 
of the ultimate physical mass m of the theory, in agreement with the philosophy of effective 
field theory. But it is essential to choose as the low frequency field a real field 

^{z) real . 

The convergent expansions are iterated Mayer expansions. They combine expansions at various length 
scales. Similar expansions can be written down which live on a multigrid |^ - cp. later. 

^We use lattice notations as follows on a d-dimensional lattice A of lattice spacing a: — a'^ J2zeA ^^"^ 
Vp/(z) = [f{z + e^) — f{z)], e^= lattice vector of length a in ^-direction. 
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This illustrates the point that the fields $ of the effective theory need not be fields of the same 
type as those in the fundamental theory . 

One finds that a local sine-Gordon action is an excellent approximation to the effective 
action when (3 is large 

= / [V,$(^)]' + 2z [1 - cos <l>(^)]^ (3) 

with 

z = a"^ exp [— /?fc6(0)] for large /5 , (4) 
vcb{0) = 0.2527 a-^ . 

^cfe(O) is the Coulomb potential at zero distance on the 3-dimensional lattice. (We chose not 
to write the terms in Tie// which implement the Pauli-Villars cutoff.) 

We proceed to a discussion of the physics of the model on the basis of of the effective action. 

1) Mass: The [1 — cos] potential has its minima at $ = 27r-integer. We shall see in a 
moment that tunneling between different minima is very much suppressed, so that the global 
symmetry 

1-^ <l>(x) + 271 N , integer 

is spontaneously broken, and we may restrict attention to one of the minima, e.g. $ = 0, and 
expand around it. Set 

= z 2/5 = 2/3 a'^^exp [-/3wcb(0)] . (5) 
Then the effective action takes the form 

The nonlinear terms [■ ■ ■] are small for large (3 (provided \E' is sufficiently near to the selected 
minimum so that the effective action is sufficiently small to give \E' a nonnegligible probability). 
The effective theory is therefore very nearly a free field theory with a nonvanishing mass m 
which is given by (|^). Numerical simulations confirm this 0. 

The low lying excitations of the model are therefore spin waves with a finite mass which is 
exponentially small in units of the original inverse lattice spacing a~^. This is easy to read off 
the effective action, but is not at all apparent from the fundamental action. 

2) Tunneling: Because of the UV-cutoff we can qualitatively think of the theory as living 
on a lattice Aq of lattice spacing = const. ■ m~^. The effective action becomes 

^e// = E ^ [V,$(X)]^ + 2zo [1 - COS ^x)] (7) 

xgAo "^P 

zq = AqZ = const. ' m~^z (8) 
= const "/?-3/2exp[/5t;cf,(0)/2] (9) 

We see that zq 1— 00 when (3 ^ 00. Therefore the hills of the 2zo [1 — cos $] - potential become 
very high when (3 becomes large. As a result, tunneling becomes very much suppressed. 
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The height and widths of the hills of the potential determine the surface tension of the 
discrete Gaussian model (the free energy per area of an interphase between domains where 
($) = 2nN with different N). This surface tension is equal to the string tension a of the dual 
[/(l)-gauge theory. Treating the tunneling in a semiclassical approximation results in 

a = 8mp-^ . (10) 

Note that 

a/m^ oo as /3 (-^ oo . 

Thus there are two different physical mass scales in this theory. 
3) 2-dimensional discrete Gaussian model: 

Next we turn to the 2-dimensional discrete Gaussian model. This model is the dual trans- 
form of the XY-model (plane rotator) in 2 dimensions with Villain action. 

A rigorous computation of the effective action of this model has not been reported in the 
literature, but the method used for the 3-dimensional model can be expected to carry over for 
large /3. In 2 dimensions, /3 is dimensionless. 

The effective action comes out the same as in 3 dimension, except for the appearance of 
different powers of a (for dimensional reasons) and the following difference in the expression for 
z. Expression (0) results from an approximation which is legitimate in 3 dimensions, but not in 
2. Originally there stands the Yukawa potential f^(0) at zero distance in place of ^06(0), with 
mass fi equal to the Pauli-Villars cutoff a^^. We are interested in the limit where /ia is very 
small. In 3 dimensions, the zero mass limit exists, and we may approximate f^(0) by vcbi^)- 
But in 2 dimensions, the zero mass limit does not exist. Instead, v^{0) increases logarithmically 
with {fia)~^ as the cutoff fi = Oq ^ is lowered. As a result, 

zo = alz = {afi)-^ exp [-(3v^{0)/2] (11) 

decreases when the cutoff is lowered, if (3 is large enough, and the mass in tree approximation 
decreases faster than the cutoff fi. Therefore, the cutoff can be lowered indefinitely. Since zq 
decreases indefinitely, the effective action tends to a massless free field theory in the limit of 
very low cutoff, provided (3 is large enough. This is the Kosterlitz-Thouless phase of the model. 

There is no particular reason to believe that the formula (^ for the effective action re- 
mains accurate for intermediate values of f3. But the nature of the Kosterlitz-Thouless phase 
phase transition could be read off, if one had an accurate expression for the effective action at 
intermediate j3. 

Lessons from the model 

• The physics can readily be read from the effective action, but is not visible in the funda- 
mental action. 

• A good choice of the blockspin $ (real as opposed to integer) is crucial. 

If we had chosen an integer blockspin, the spin waves of zero or small mass m as the 
dominant low energy excitations would not be represented by fields in the action. Moreover, 
one would meet a disaster while calculating the effective action. If one tried to compute it by 
lowering the cutoff step by step (which is what an iterated Mayer expansion does), the action 
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would become nonlocal over distances m ^ while the UV-cutoff /i is still very much higher than 
m. A good choice of the blockspin is 

$(z) ~ average of n{w) over a cube of side ao 

using a smoothened step function for the cube. 

Stages of complexity of the effective action 

To be specific, let us start from the fundamental action of a Higgs model with nonabelian gauge 
fields living on A. A may be either the continuum or a lattice of small lattice spacing. We seek 
an effective theory on a lattice Aq of lattice spacing oq. 

We identify the points x G Aq with hypercubes in the fundamental lattice A. 

These hypercubes are called blocks. Sometimes we will also regard Aq as a sublattice of A, 
so that X G Ao fixes a site in A, e.g. the block center. 

One may envisage different kinds of and approximations to the effective action which require 
different kinds of blockspins 

1. Ignore both the gauge field U and the fermion fields iJj. 

Approximate the fundamental theory by a Ayj^-theory. Choose the block average of the 
scalar field as blockspin |Q 

C{x,z) ipa{z) = Zq (ao) av^(:-,ipa,{z) ■ 

1 /2 

with oc step function. A cutoff dependent factor Zq (ao) (wave function renor- 

malization) has been included with the aim to normalize the kinetic term in the effective 
action in the standard way. 

2. Ignore the gauge field, but consider fermions. 

An effective action which has chiral symmetry requires 2*^/^ flavours f in d dimensions 
(for even d). Otherwise the effective action will inevitably become nonlocal (Nielsen- 
Ninomiya theorem 0). Given fundamental fermion fields with 2^^/^ flavours, we may 
block to staggered fermions on Ao of lattice spacing ^ao. Aq decomposes into elementary 
cells of sidelength ao containing 2'^ sites each. Only translations by integer multiples of 
ao are true translations. Sites within one elementary cell carry different pseudofiavour (a 
combination of spin and flavour). A suitable blockspin is of the form 




where a, / are spinor and flavour indices. averages over a hypercube of sidelength 
oo centered at x, and is only nonzero when (a, /) matches with the pseudofiavour of x. 
Details will be given later on. 

3. Admit dynamical gauge fields. 

The blockspin for the Higgs field will take the form 

$a(x) = / C{U\x,z) ipaiz) (13) 



7 



similarly as in 1., but with an averaging kernel C which depends on the gauge field U for 
reasons of covariance. A formula like (|12D with ^/-dependent kernels Cl will define the 
blocked Fermi fields. In addition one will need to define blocked gauge fields. This will 
be discussed later on. 

4. Composite Higgs. 

There may be no fundamental Higgs field in the fundamental action. In this case one 
may try to define a composite Higgs field $ in the effective action by a formula of the 
following kind 

^c.{x) = f f Y.'^i Cfj{U\x,zw) Xo^r^w) . 

J z J w 

with matrices Aq, in colour space (colour indices of the Fermi fields are omitted). 

5. Fully effective theory. 

In this case, the fields in the effective action are all interpolating fields for physical parti- 
cles. They can be defined as composite fields similar to 4., but they will have to be gauge 
invariant. For instance, the physical electron field could take the form 

^(^)= / / ^[iz)C[iU\x,zw) 

J z Jw 

where ip[ are the left and right handed fundamental electron field and (p is the fundamental 
Higgs field. 

One may wish to compute the ultimate effective action in a sequence of steps, lowering the cutoff 
in stages. In this case, one may have to proceed to different kinds of definitions of blockspins 
at different values of the UV-cutoff corresponding to different length scales of compositeness. 

2 The Constraint Effective Potential 

Starting from a fundamental theory on a finite volume, the constraint effective potential is 
defined as the special case of the effective action in which the lattice Aq consists of a single 
point. In this case, the block spins $ may be vectors in colour or flavour space, but they have 
no space time argument x. 

Consider for instance a fundamental theory which is a Higgs model with or without dynam- 
ical gauge flelds. For simplicity, we do not include fermions. One introduces 

$ = (gauge covariant) magnetization 

by a formula similar as before. The constraint effective potential V^jj($) was introduced in ^ 
and is deflned by 

e-^=^^(*) = Jv^VU 6{<^ - C^) e-^(^'^) (14) 

and similarly in the absence of gauge flelds. C depends on U in the presence of gauge flelds. 
exp {— Ve//($)} gives us the probability density of the magnetization $. But 

Veff ^ volume . 

Therefore the minima of Veff become extremely sharp in the limit of large volume, and in the 
inflnite volume limit, the only possible values which $ may assume are absolute minima of 



8 



Veff. Therefore, the shape of in the large volume limit gives information about the phase 
structure of the theory. One has to look for minima. If there are several absolute minima which 
are transformed into each other by a symmetry, then that symmetry is spontaneously broken. 
For plots which show the crossover of the effective potential from a single shape (symmetric 
phase) to double well behaviour ( broken phase) see Fig. ^ 

If the height of different relative minima changes when one varies a parameter in the theory 
in such a way that the absolute minimum jumps, then the theory has a first order phase 
transition. 

3 Renormalization Group Picture 

In the renormalization group approach one composes the blocking 

A t-> Ao 

n ^ n^jf 

from a sequence of blockings, lowering the cutoff in steps 

A = Ajv ^ Ajv-i I— »• . . . I— *■ Ai 1-^ Ao 
a = t-^ Oiv-i ^ . . . ai t-^ ao 

where aj^i = Laj with L = 2, 3 or so. 

The theory with action W lives on a lattice of lattice spacing aj. Considering one such 
theory at a time, one goes to the unit lattice by setting aj = 1. In this way one obtains a 
sequence of actions W . 

^ ^N-l ^ , , , ^ ^1 ^ ^0 ^ ^^^^ (^5) 

These actions can be compared, since they all live on the same lattice with lattice spacing 1. 
This yields the 

renormalization group flow: W ^ W"^ . 

Imagine that the actions are somehow parametrized. One distinguishes between relevant, 
marginal, and irrelevant parameters. These parameters are also called "running coupling con- 
stants" . 

Relevant: A small change in a relevant coupling constant in Ti^ changes very much for 
large n. 

Irrelevant: A small change in an irrelevant coupling constant changes TY-'^" arbitrarily little 
for large n. 

Marginal: in between. 

^Ve///volume becomes convex in the infinite volume limit because the height of maxima of Veff between 
minima increases less fast than the volume. 
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Figure 1: Multigrid Monte Carlo data for Ay? ^-theory with error bars as a function of 
the magnetization $ for bare couphng = 16.376 and fit with the one loop formula for the 
constraint effective potential for three values of the bare mass a) = —1.14 , b) = —1.15 



c m: 



-1.16. 
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In the limit of small a, many renormalization group steps will be needed. As a result, knowledge 
of all relevant and marginal coupling constants in Ti suffices to determine Ti-eff- Theories with 
actions Ti whose relevant and marginal parameters all agree are said to belong to the same 
universality class. 

Let us remark that one can also lower an UV-cutoff (Pauli-Villars, for instance) infinites- 
imally in one step. In this case the renormalization group flow is determined by a functional 
differential equation known as the renormalization group differential equation |]^. 

In the conventional approach, all the actions W depend on the same type of fields. But 
we noted before that more fiexibility should be left. It may be necessary to change the kind of 
fields at some length scales (of compositeness) Oj. In particular, W may depend on different 
kinds of fields than the fundamental action Ti when j < N . 

Effective observables 

In principle, expectation values (O)?^ of arbitrary observables in the fundamental theory may 
be translated into expectation values {Oeff)'Heff effective theory, such that 

{0)n = {Oeff)'H,ff ■ 

But this requires that one computes effective observables in addition to the effective Hamiltonian 
0, 

Oeff{<l>) = e^^«(*) J Vif 0{if) 6{<^ - C{^)) e-^(^) . (16) 

4 Computation of Effective Actions by Perturbation The- 
ory (Loop Expansion) 

for linear averaging maps C in theories with or without gauge fields. 

Scalar free field theory 

^frcc(^) = ^J^^{z)[-A + m^]y^{z) = ^{^, l-A + m^]^) . 

Herein, A may be either the ordinary Laplacian, or the covariant Laplacian in some given 
external gauge field, and the original theory may live either on the continuum or on a lattice 
A. Its sites are denoted by z,w. Our aim is to compute an effective action on a lattice Aq of 
lattice spacing oq < 0(m~^). Sites in Aq are denoted by x,y. 
We assume a linear averaging map C, 

$(a;) = / C{x,z)ip{z) = Cip{x) . 

J z 

In the absence of a gauge field, we may choose 

otherwise. 

The original lattice A of sites z is covered by a block lattice Aq, whose sites x are blocks with 
side length ao, so we write z E x if the point z is in the block x. More generally, C may depend 
on an (external) gauge field. 
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Following Kupiainen and Gawedzki 0, T^, one splits the field (p into a low frequency part ip, 
called the "background field" , and a high frequency part C, called the "fiuctuation field" . The 
low frequency field is supposed to be obtainable from the block spin $ on Aq by interpolation 
with an interpolation kernel A. 

= A^{z)+C{z), with A^{z)= I A{z,x)^{x) . (18) 

If we demand that 

CA=1, (19) 
then it follows that the fiuctuation field has zero block average, 

= . (20) 

Now we insert the field split into Tifree- Let us consider the case m = 0. Generalization is 
obvious. 

((^, A<^) = (^<l>, A^<l>) + (C, AC) + 2(C, A^<l>) . (21) 
We demand that A is chosen in such a way that the mixed term vanishes, 

(C, A^$) = . (22) 

This gives 

^free=^($,-Ae//$) + ^(C,AC), (23) 

with 

= A*AA . (24) 



Since = it follows that the mixed term (p2D vanishes as desired, provided 

- A^ = C*u-^ (25) 

for some u^^. 

Applying A* to both sides of ( p5D it follows that 

= -Ae// . (26) 

The operator A^ff acts according to 

A,fffix)=[ A,ffix,y)fiy) (27) 

JyeAo 

with kernel 

A,ff{y,x)= J^Aiz,yyAAiz,x). (28) 

Here, A acts on the argument z, and ^ denotes the hermitean conjugate of a matrix, or the 
transpose, if the matrix is real. 
TsD becomes 

AA{z, x)= I C{y, z)^A,ff{y, x) . (29) 



12 



If C{x,z) is the step function (|T^) then (p9|) says that AA{z,x) should be constant on blocks 
y as a function of z. The kernels A and ^eff are determined as solutions of (p9[) together with 
the subsidiary condition (^). 

Remark: The field iplz) = A^{z) minimizes 7ifree(V^) subject to the subsidiary condition 
that ijj has the prescribed block average $, viz. 

C-ipix) = $(2;) . 

This remark can be used to compute the kernel A by standard optimization algorithms. 

Numerical results confirm that the kernel A{z, x) decays exponentially with the distance 
of z from the center of the block x, with decay length oq. This is true with and without an 
external gauge field in A, see for instance Fig. ^ of Sect. ||. As a result, Ae//(x,?/) decays 
exponentially as a function of \x — y\ with decay length one lattice spacing oq. 

Now we are ready to compute the effective action. By definition 

g-He//(*) = J _ C(^)e-^f-('^) . (30) 

We shift the field ip by an amount depending on $, using ( = (f ~ A^ as a new variable of 
integration. Since CA = 11 by assumption, it follows that 

= ei(*,A.//*) J s{CO e-^f-^(« . (31) 

The C-iiitegral merely produces a constant independent of $. Therefore 

= ^($, -Ae//$) + const. . (32) 

Heff has good locality properties because of the aforementioned decay properties of Ae//(x, y). 
There are exponential tails, but this is tolerable and in general inevitable. 

Split of the propagator 

The split of the field induces a split of the free propagator v = (— A)~^ into a low frequency 
propagator viow and a high frequency propagator (fluctuation field propagator) F. For proper 
choice of the normalization factor A/", 

d/xr(C) = Af-'V( e-(^'-^^)/2 ^^(jQ (33) 
is a normalized Gaussian measure with covariance 

T{z,w)=Af-^ J VC 6{CC) e-(^'-^^)/2 ^^^^ ^^^^ _ 

Thinking of the 5-function as a limit of a Gaussian, one sees that 

F = lim (-A + kC*C)-^ (34) 

Since ( is supposed to be the high frequency part of the field if, it is a field with an infrared 
cutoff (equal to the UV-cutoff of the effective theory). Therefore one expects that F(z, w) decays 
exponentially with the distance of z from w, with decay length qq. Analjd;ical and numerical 
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results confirm that this is indeed the case. This remains true when there is a a gauge field in 
A, provided the kernel C is chosen in the right way. A proper choice will be introduced later. 
Let us now compute the 2-point function. 

{^{z)^{w)) oc Jv^V^ 5($ - dp) e-^f-^('^) ^{z) ip{w) (35) 
= Jv^VC e(*'-^^//*)+(^'-^f) 6{C0{A^z) + C{z)){A^w) + ({w)) (36) 

Since u = (— Ae//)~^ it follows that 

for a proper choice of the normalization factor A/$. 

As a result we get the following final result for {ip{z)ip{w)) = v{z,w) 

V{Z, W) = Vionj{z, w) + T{z, w) , (37) 

with 

vionj =AuA* , u = . (38) 

More explicitly, (|37| ) reads 



T{z,w) = v{z,w) — / / A{z,x)u{x,y)A\w,y) (39) 
V = -A-^ and u = -A;/^ (40) 

Perturbation theory 

We will now show how to compute effective actions by perturbation theory. To be specific, let 
us consider an action for a scalar field (p of the form 

= j{-\^{z)A^{z) + V{^{Z))} = {n,ree + V){^) . (41) 

with 

for instance. Introduce 

K,(C)= / V{A^{z) + az)) . (43) 

J z 

This determines a self interaction of the high frequency field C, which depends parametrically 
on the block spin $. 

Proceeding as before one finds the following expression for the effective action 

^e//($) = ^eff<^) + K//($) (44) 

with 

= j e-F.(c) . (45) 
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We remember that cZ/ip is the normahzed free field measure with propagator T. The logarithm 
of the C-integral can be computed by standard perturbation theory. It is given by a sum of Feyn- 
man diagrams, whose lines represent propagators F, and whose vertices depend parametrically 
on $. 

The resulting effective action Tie// will have good locality properties, provided the fluctu- 
ation fleld propagator r{z,w) and the interpolation kernel A{x,z) decay exponentially with 
decay length oq. 

Perturbation expansions of the kind described here were flrst done by Kupiainen and 
Gawedzki as part of a rigorous renormalization group analysis. Constraint effective 



potentials were also computed by another method in [11 



Constraint effective potential 

We specialize the results of the previous subsection to the case of the constraint effective 
potential. In this case, Aq has only one point x. Therefore the kernel of Ae// can only have 
fx, as its only pair of variables. We admit the possibility that A is the covariant Laplacian 
in an external gauge fleld. Of course, Ae// will also depend on U in this case. We abbreviate 

Aeff{x,x) = Co • 

Eq. (^) simplifles to 

K//($) = K//($) = $ eo $ + Kffi^) . (46) 

with Veff from (^). 

In the presence of an external gauge fleld U, eo depends on U. Vejf will also depend on U, 
because the high frequency propagator F does, and the kernel A as well. In addition there is a 
gauge fleld dependence in the normalization factors of the Gaussian measures. This leads to a 
$- independent, but ^/-dependent term in Veff which will be ignored for now, but which has to 
be taken into account later on when we consider dynamical gauge flelds. 

Without gauge fleld, eo = 0. But in the presence of a gauge fleld which is not a pure gauge, 
—A is strictly positive, and therefore eg is also strictly positive. 

Let us emphasize that Tie// has no infrared problems because F has a built-in infrared 
cutoff. 



1-loop approximation 

In 1-loop approximation, the contribution Veff to the constraint effective potential (^6|) becomes 
K//($) = / / ViA^z))T{z, w)V{A^w)) + ^Tr ln(-A + V'iA^)) (47) 

J J z,w Z 

where a prime (') indicates a derivative with respect to the argument, and V" is the multipli- 
cation operator deflned by 

iV"{A^)f)iz)=V"iA^z))fiz) . (48) 

In graphical notation 

Tr ln(-A + V"(^$)) = J2 ^ f ■ ■ ■ f 



n>l 
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with Feynman rules 



w 



T(z,w) 



The first term in expression ( ^Tj) is not 1-particle irreducible. This term vanishes in the absence 
of gauge fields, and more generally it vanishes up to second order in the gauge coupling constant. 

The contribution Veff to the full effective action on a lattice Aq of more than one point 
has the same form. The 1-particle reducible contribution is important in this case [lOf . 



Use of the 1-loop formula to compute finite size corrections to the 
constraint effective potential 

For definiteness, let us consider Ay^^-theory. 

From the constraint effective potential/ volume in the infinite volume limit one can extract 
renormalized coupling constants, provided one knows the wave function renormalization con- 
stant from an independent calculation. 

In fact, according to the standard renormalization of the Ay^^-theory, one needs to define the 
three quantities mr,Xr and ( renormalized mass, self-coupling constant and wave function 
renormalization constant respectively). This can be achieved in the symmetric phase through 
the second and the fourth derivatives of the constraint effective potential at its minimum, viz. 

^ (49) 



and for the self-coupling constant 

A. = I^IL^. (50) 



In the broken phase it is useful to define the renormalized square mass also through (|49|) 
but the self-coupling constant A^ is now defined through the tree level relation: 

Xr = ^ml/vl (51) 

where Vr is the vacuum expectation value of the field = Z~^^'^Lp and Uejf = ^///volume. 

One can obtain a renormalized loop expansion for the constraint effective potential by ma- 
nipulating the bare expansion. Eqs. (^) - ( ^I]) determine the renormalized coupling constants 
as functions of the bare parameters. Inverting this relations, one obtains the bare quantities as 



a function of the renormalized ones. One may use them to express V in expression (^71) ( or its 
corresponding n-loop generalization ) for the constraint effective potential as a function of the 
renormalized coupling constants. Reordering the resulting expansion in number of loops gives 
the desired expansion in terms of Xr,mr. 

In practice, numerical computation of constraint effective potentials is only feasible on finite 
and not too large lattices. 



The following strategy, which was first proposed by one of us (G.P.) in [0], eliminates the 



need to use very large lattices. It makes it possible to extract renormalized coupling constants 
from numerical data for values of the bare coupling constants which are very close to the critical 
line, and for lattice sizes which need not be large compared to the correlation length. 
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Table 1: Renormalized parameters from the fits of Fig. ^ and the corresponding values obtained 
from Liischer and Weisz (^^). 





mr (oo) 


Xr (oo) 


mi^^ 




ml = -1.14 


0.079 ±0.003 


7.5 ±0.6 


0.08 


7.4 ±0.2 


ml = —1.15 


0.06 ±0.005 


7.1 ±0.1 


0.06 


7.2 ±0.2 


ml = —1.16 


0.13 ±0.008 


7.8 ±0.1 


0.13 


7.6 ±0.2 



Triviality of A<y9^-theory implies that the renormalized coupling constant A^. is necessarily 
small when one is close to a critical point, provided the UV-cutoff is high enough, i.e. provided 
a -C correlation length. 

Therefore, a perturbative calculation of the constraint effective potential on a finite lattice 
as a function of the renormalized coupling constants will yield accurate results. In the practical 
application |12, 13] it was verified that the two loop corrections are indeed very small. 

The result of this computation is fitted to numerical data taken at various values of the 
bare coupling constants to extract the renormalized coupling constants as functions of the bare 
ones. 

Some results of this kind are shown below. Numerical results of Meyer and Mack [|15] are 
shown in Fig. ^ together with a fit by a renormalized 1-loop formula for the constraint effective 
potential. Numerical results for the constraint effective potential for 4-component <^^-theory at 
stronger coupling but further away from the critical line were presented in |TE . 

The results are compared with analytical results of Liischer and Weisz [|17| in Table ^ . 

Perfect agreement is found. In this way, the nonperturbative part of the analytical compu- 
tation of Liischer and Weisz is sucessfuUy tested. 



5 Averaging and Interpolation Kernels C and A in the 
Presence of Gauge Fields, for Bosons 

In the presence of a gauge field U, the definition of a blockspin $ for a (n-component) scalar 
field if and the split of this field into low frequency and high frequency part read 

$ = C{U)^, 

C{U)A{U) = 1 . (52) 
The operators C and A have kernels 

A{U)(!?{z) = f A{U\z,x)^x), (53) 
CiUMz) = I C{U\x,z) ^{z) . (54) 

As usual we denote points of the block lattice A° by x, y and points of the fundamental lattice 
A by w. 
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In the absence of gauge fields, a good choice of C is given by 



C*[z,x) = C{x,zy 



lag ii z e X 
otherwise 



denotes the hermitean conjugate of a matrix.) This satisfies 

A,, r*(^ r]-f ^o(^) C*{z,x) ifzex , , 

-/\m,.C j Q otherwise . ^^^^ 

An,x is the lattice version of the Laplacian with Neumann boundary conditions on the boundary 
of the block x, and eo{x) = is its lowest eigenvalue. In the presence of a gauge field, the 
Laplacian with Neumann boundary conditions takes the form 

^N,Jiz) = a-' J2 Pi^, ^)fH - fi^)] ■ (56) 

to nn.z,w£x 

U {z, w) is the lattice gauge field attached to the link from w to z, the summation runs over 
those nearest neighbours w of z which are inside the block x. 

Proposal: [|18[ Retain the eigenvalue equation ( p5D for C in the presence of gauge fields. 



with covariant Laplacian Atv,^ as defined in (^), and eo{x) equal to its lowest eigenvalue. Of 
course, both C and eo(x) will depend on the gauge field. 

The interpolation kernel A may also be defined by the same equation as before. There 
should be u{y, x) such that 

- AAiz,x) = I C*iz,y) uiy,x) . (57) 

Here A is the gauge covariant Laplacian on the whole lattice, m is a Lagrange multiplier which 
is determined by the subsidiary condition (^). 

In the special case when the block lattice Aq consists of a single point x, the Neumann 
boundary conditions are inoperative, because the block x has no boundary when periodic 
boundary conditions for A are in force. Therefore Atv,^ = A. The equation for A is therefore 
satisfied by 

A = C* if Aq = single site . (58) 

Let us next discuss the arbitrariness in the solution of (^5]). Consider for example the gauge 
group G = SU{2). For fixed x, the 2x2 matrix C{z,x) may be thought to be composed of 
two 2- vectors, each of which is a solution of the eigenvalue equation (|55D . (It can be proven 
that there exist two linearly independent solutions with the same eigenvalue.) Arbitrary linear 
combinations of these two solutions will also be solutions. This reflects itself in the arbitrariness 

C*{z,x)^ C*{z,x) n{x) (59) 

of the 2 X 2-matrix solution C of (0), where VL{x) are arbitrary 2 x 2-matrices. To eliminate 
this arbitrariness, we may impose subsidiary conditions 

1. Normahzation: CC* = 1. 

This restricts the arbitrariness to unitary matrices Q{x). 
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2. Gauge transformation property: 

Select a distinguished site x in the block x - for instance the center of the block - and 
demand that C*{x, x) > as a matrix. This makes the kernel C unique. It will transform 
under a gauge transformation | according to 

C*{z,x)^giz)C*{z,x)gix)-' . (60) 



It follows that the blockspin $(x) transforms like a field sitting at the block center x, 

$(a;) 1-^ g{x)^{x) . (61) 

Given C, the kernel A is also unique and it transforms under gauge transformations exactly 
hke C*. 

There exists an efficient algorithm for computing the gauge field dependent kernels C and 



A Pf 



Let us consider the implications for the effective potential in the presence of dynamical gauge 
fields, (01) • Since the U-dependent kernel C will depend on the choice of x, the gauge covariant 
magnetization will also depend on x. But it is readily verified that the effective potential ([T^) 



is independent of the choice of x |14 



6 Effective Potential for a Higgs Model with Dynamical 
Gauge Fields 

Let us consider a Higgs model with dynamical gauge fields but without fermions, with a gauge 
group SU{2) and a 2-component scalar field ip. Let 

$ = gauge covariant magnetization = / C{z)ip{z) , (62) 

J z 

defined with the help of the gauge covariant kernel C (x, z) which was described in the last 
section. The argument x is redundant because there will be only one point x in Aq. Of course, 
C and $ will depend on the gauge field f/, and so does A = C* . By definition, 

g-v;//{4-) ^ J-pu ^-v{u,'S>) 
-nu,^) ^ [ -p^ _ cy,) e-«(^'^) . (63) 



7i is the standard action for a Higgs model. Using an appropriate lattice definition of the field 
strength tensor Ff^,y |]19[, the Wilson action for the lattice gauge field can be written in standard 



Ff,^F^i,-ioYm, and 



n 



\F,,F;, + ^{y,,-A^)+V{y^{z)) 



V{^{z)) = lmV(^) + ^AV(^) • (64) 

A is the gauge covariant Laplacian. It depends on U. 

® Remember that under a gauge transformation (p{z) i— > g{z)(p{z) and U{z,w) i— > g(z)U{z,w)g{w)~^ . The 
covariant Laplacian Ajy^x is gauge covariant. Therefore the eigenvalue equation (p3) is also gauge covariant. 
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see 



The functional integral will be performed in two steps. (For details of the loop expansion, 
14|.) Calculations by a different method were reported in [^] for a t/(l)-gauge field.) 



if integration 

Dip (5($ — ^^C{z)ip{z)) is an unnormalized Gaussian measure with nonzero mean. First we 
shift the field to bring the mean to zero. We introduce a new integration variable C, in place of 

ip{z) = ri{z) + Ciz) with r]{z) = A{z) $ (65) 

Since Aq is a single point, the choice ^ = C* is in agreement with previous definitions. The 
interpolation kernel satisfies therefore the eigenvalue equation 

- AA{z) = eoAiz) , (66) 

eo = lowest eigenvalue, A = covariant Laplacian on the lattice with periodic boundary condi- 
tions. 

Using A* A = CA = 1, the kinetic term becomes 

-A^) = $60$ + (C, -AC) (67) 

so that 

e-nu,'^) = e-\^^^iu)^ jvC5{jc{z)C{z)) exp (-i(C, -AC) + V(r^(^) + C(^)) ■ (68) 



The eigenvalue problem (|6g) can be solved by time independent perturbation theory which is 
familiar from elementary quantum mechanics. Set 

U{z + e^,z) = l-gA^{z) + ... . (69) 

To order g"^ one finds 

eo = ^ / I^.WP + ^E / / A,{z)T{z,w)A^{w) . (70) 

The high frequency propagator F is only needed to zeroth order in g and the expression valid 
in a scalar theory without gauge fields can therefore be used for it at this point. 
In conclusion 

/ {^^M.^;. + = J \f,.f;^ + \^'m j\A,? + .... (71) 

with 

m\^) = ^g'm' . (72) 

We see that a ^-dependent mass term for the gauge field has appeared. 

In the full theory, the fluctuations of the random variable $ tend to zero in the infinite 
volume limit (cp. J. Klauders lectures at this school [^), and its value becomes fixed to the 
position $ of an absolute minimum of V^j/($). As a result 

m^($) = W-meson mass . (73) 
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The integral 

J VCexp (-i(C, -AC) + / Viviz) + C(^))) (74) 

is computed by expanding the exponent in the gauge couphng constant g and using the standard 
formula for a Gaussian integral 



/ 2^Cexp|-E{^(C|M,C) + (Cl^.)}} = (detM)V2exp (^^ • (75) 

Expressing it in terms of the normalized Gaussian measure dur with covariance (= free 
propagator) F, the integral ( fTlD equals 

e-^T"^^ ^-/^v(,(.)+cW) . (76) 

The high frequency propagator F depends on the gauge field, and so does A = C* and therefore 
rj, defined in (^). F was defined in (|^) and 

Trc = trace on the space of functions / which obey Cf = 0. 

With some effort, Tr^ InF can be evaluated to second order in g. It is a quadratic expression 
in the gauge fields which will add to the kinetic term of the gauge field. 



Gauge field integration 



Expanding everything to second order in the gauge coupling constant g, the integration over 
the gauge fields becomes a Gaussian integration which may be performed. Thanks to term 
$eo$ in in (p7| ) and also additional quadratic terms in the vector potential which resulted 
from the ip integration, the kinetic term for the gauge field is nondegenerate. 

In fact, it turns out that the whole quadratic contribution in the gauge field in the large 
volume limit is equal to 



S'M] = ^T.i:^rs |5,.p + m2($)] - (^1 + ^^(<|.)0(P)) k,K^ \A,{k)\' (77) 

where Af^{k) is the Fourier transform of the gauge field ^^(-z) and 0(A;^) is equal to one for 
k'^ and the interpolation to continuous fc^ vanishes faster than fc^ for /c^ — > 0. 

The quadratic operator of (^) does have an inverse (the propagator) and therefore no gauge 
fixing is necessary! 

The final result in finite volume is given in (|120|) in the Appendix. 



Comparison with known results ||2^ for the conventional effective potential reveals that 
( |120D simplifies in the infinite volume limit to 

K//($)/ volume = conventional effective potential in the Landau gauge. (78) 

The conventional effective potential is the Legendre transform of the generating function 
for Greens functions, with a constant field as its argument. It has been computed before 
on a finite lattice in the Landau gauge. 
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Figure 2: The Constraint Effective Potential in the presence of gauge fields in the infinite and 
large cutoff limit. The gauge coupling was fixed to be = 0.4, the renormalized mass 
is given by = — | % and the scalar self coupling constant assumes values on both sides of 
the critical line (solid line). We observe that the minimum jumps from zero to $min when the 
parameters are varied from the symmetric to the Higgs phase. This means that the system has 
a weak first order transition. 



If one performs a formal continuum limit in addition to the infinite volume limit in order to 
replace sums over k by integrals, the expression ( |120| ) simplifies (after inserting counterterms 
into the action) to the following expression for the one- loop constraint effective potential t/g// = 
Ve/// volume 



[/e//(<f)= t/c/($) + 



647r^ 



with 



+3 (*($))' [In (^) 



(79) 



9/ity($) 



ln(^^($)) 



and 



m2($) = i(72$2_ 



(80) 



The expression (|7y) is manifest gauge invariant and it agrees with the known result of 



Anna Hasenfratz and Peter Hasenfratz for the conventional effective potential computed 
in the Landau gauge. $ is the unrenormalized covariant magnetization, i.e. no wave function 
renormalization factor is included in its definition. 

Small renormalized couplings = 0{g^) are within the domain of validity of the pertur- 
bative result of (^). Its shape is shown in Fig. |^. Inspection of Fig. ^ shows that the absolute 
minimum jumps when the parameters are varied. Therefore there is a first order phase tran- 
sition at weak couplings. This was noted before by Hasenfratz and Hasenfratz |2^, following 
the pioneering work of Coleman and Weinberg ll2T 
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7 Blockspin for Fermions 



As already discussed in T. DeGrand's lecture p5|, the naive discretization of the continuum 
fermion action results in a lattice model with unphysical low energy modes, called "doublers" . 
To reduce the number of flavours, i.e. the degeneracy of the lattice Dirac operator, one can 
use one-component "staggered" fields which live on different lattice sites. In d dimensions, a 
staggered fermion field describes physical fermions with 21'^/^] flavours. 

We shall first be concerned with blockspins of the form ([T^), i.e. we ignore gauge fields for 
now and consider only fermions. 



Free Staggered Fermions on the Lattice 

For simplicity let us first look at free staggered fermions on a dimensional lattice Ka/2 = 
Q^oKY of lattice spacing a/2. Only translations by integer multiples na of a are regarded as 
true lattice translations, i.e. survivors of translations in the continuum. These true translations 
map the sublattice = {a^Y of lattice spacing a into itself. 

We imagine that the lattice Aa/2 is divided into hypercubes of sidelength a containing 2"^ 
sites each. Sites within one hypercube are distinguished by their "pseudoflavour" H. H is 
specified by a set of distinct indices /ij 

H = {fiifi2- ■ ■fJ'h,0 < h < d,fii < fi2 < . . . < fih} ■ (81) 
Sites Xh in Aa/2 with the same pseudoflavour H are of the form 

Xh = x + ^en, e// = ^ e^, x e Aa 

where is the lattice vector of length a in /i-direction in A^. The sites xh form a sublattice 
Af of Aa/2 (see Fig. D, viz. Af = A, + ^en- 

Pseudoflavour is a combination of spin and flavour in the sense that there is an expansion 

j:x{xH)dx^ = Y.^J{x)Zj . (82) 

which exhibits the field x{xh) at site in a hypercube as a linear combination of field 
components with spinor index a and flavour index f (Z = {Zj) is a matrix of differential 
forms). Each index takes values 1, . . . , 2['^/^l. For details, see [p6[| . 



Blocking Consistent with the Symmetry 

We distinguish between two kinds of blockings: from the continuum to some lattice, and from 
one lattice to a coarser lattice. We wish to block in such a way that as much is preserved of 
internal and space time symmetries as is possible. 



Blocking from the Continuum to the Lattice. 

Given Fermi fields '^^{z) on the continuum with 2^'^^'^^ flavours /, let x^i^) be the linear 
combination of field components of given pseudoflavour. The staggered blockspin on the 
lattice Aa/2 has the form 

E{x) = f C^(x, z) x'^iz) if X e Af C Aa/2 . (83) 
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Figure 3: Part of a pseudoflavour lattice A-^ for d = 2, H E {0, {1}, {2}, {1, 2}}, embedded 
into A = Z X Z. 

Here is an integral over continuous space. 

In the absence of gauge fields the kernel of the averaging operator is given by 

{-) C j Q otherwise . ^^^^ 

i.e. one averages over a cube of sidelength a with center x. Note that the cubes around block 
lattice sites x,y may overlap if x,y have different pseudoflavour (see below). It can be verified 
that this choice of blockspin leads to good locality properties of the effective free action on the 
lattice [|7|. 

We proceed with considerations on the staggered fermion symmetry group. 
Residual Symmetry on the Lattice. 

For bosons, any flavour symmetry G in the continuum is preserved on the lattice; continuous 
space time translations get broken to lattice translations and similarly for Euclidean rotations. 



For staggered fermions, the situation is different [^: blocking as described above from 
the continuum, the total symmetry Q = G x T (e.g. flavor group G = SU{2) for d = 4, 
translation group T) gets broken to a subgroup Qa on a lattice of lattice spacing a/2, is 
generated by translations by integer multiples of a and by certain combinations ed^ of flavour 
transformations {e = ±1), and translations by eH/2. The action Ta of the lattice symmetry 
group on a staggered field on the lattice Aa/2 reads as follows 

{Ued^)x)iy) = e ph,k xiv + \eK) if 2/ e Af . (85) 
The symbols Ph,k denote sign factors as in the Clifford product of forms V, defined by 
dx" V dx^ = dx^" A dx" + 6^"" and dx^ V dx^ = pH,Kdx"^^ , 
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with the symmetric difference HAK = {HUK)\{Hr\K) of sets H, K of pseudofiavour indices. 

This agrees with the known symmetry group of standard staggered fermion actions ||29[ . 
Note that translations by ch are true translations. 



Fine-to-Coarse Blocking. 

The fine-to-coarse blocking step maps fields which live on the fine lattice Ka/2 onto fields living 
on the coarser lattice of lattice spacing Lba/2, with an integer scale factor Lb > 1. It is 

natural to demand that the symmetry group Q^^a on the coarse lattice should be a subgroup 
of the symmetry group Qa on the finer lattice 

QL,a C Qa . (86) 

In addition to the true translations by e'^ = L^eu there will be combinations [| 

of flavor transformations and translations by \e'fj. 

The symmetry operations d'jj must be a combination of some dx and a true translation e. 
From (^), it can be seen that d'j^ contains an admixture of translation by — Since dx 
contains an admixture of a translation by \eK, we must have 

-\e'H = -l-ex + e with e = ^n^e^ . (87) 

Since e'fj = LhCn, the only way how this can happen is that H = K and e = |(Lb — l)eH = 
true translation. This requires that Lf, is odd, hence Lf, = 3, 5, . . . but not = 2 \ 

This result might have been expected, since only for odd Lf, the pseudofiavour (and parity) 
on the fine lattice Aa/2 and on the block lattice match nicely when we regard AL^a/2 as 

sublattice of Aa/2 (see Fig. § below). 

For a true symmetry it should not matter in which order we apply the blocking step and the 
symmetry operations: the blocking map C should commute with the action T of the symmetry 
under GL.a C Ga, viz. 

TLaid'n) C{x, z) = C{x, z) Taidn) T,(i (L, - 1) en) ■ (88) 
This is fulfilled if C is translation invariant so that 



C{x + e'H,z + LbeH) = C{x,z) (89) 

and if in addition C has the support property 

C{x, z) = unless x, z carry the same pseudofiavour. (90) 

In words, the support property states the following. If ?/ G A^^^^^ then the (staggered) blockspin 
S(|/) is a weighted sum of fields x{^) at sites with the same pseudofiavour H. Blockings with 



such a property were considered for 2-dimensional U{1) gauge fields by Ben Av et al. ||30 
However, they chose a scale factor Lb = 2. 



""A prime (/) refers to the coarser lattice. 
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Generalization to Nontrivial Gauge Fields 



We now admit dynamical gauge fields in our staggered fermion theory; the averaging kernel C 
will then have to depend on the gauge field for reasons of covariance as in (|T3p. Our interest 
will gradually shift towards a practical implementation of the renormalization group picture 
outlined in Sect. ^, known as the multigrid approach to numerical computations in lattice field 
theories Q. 

To specify a blocking prescription, we retain blocks as introduced in Sect. |^: the solution 
of the gauge covariant eigenvalue equation 

- /\b.cC*{z,x) = \o{x)C*{z,x) (91) 

yields a gauge covariant kernel C. Results reported below show that ( |9lD produces a good 
blockspin when Af,.c. is the Laplace operator with appropriate boundary conditions (basically 
Neumann b.c). The covariant Laplace operator connects only lattice sites of the same pseud- 
ofiavour; therefore (|91|) is consistent with the condition (|90|) above. 

In a multigrid approach, one needs interpolating kernels A from coarse to fine lattices in 
addition to the averaging kernels C. The kernels A should be smooth. In the present context, 
there are a priori two possible definitions of smoothness. 



1. "Laplacian" smoothness. 

X possesses Laplacian smoothness if 

(V^X,V^X) = (X,-Ax) 

is small, with the covariant Laplacian A: for our model of staggered fermions this is expected 
to work if the field strength tensor F^^,, conveniently defined on the lattice by 

F^,{z) = U{z + ^e^, u) U{z, /i) - U{z + fi) U{z, u) . (92) 

is small. This is the standard definition of smoothness. 

The problem is that there are no smooth functions in this sense if the gauge fields are 
disordered, because A is strictly positive in this case. 



2. "Diracian" smoothness. 

X possesses Diracian smoothness if 

is small. Since 

-f^ = -A + a^,F^, , (93) 

sites of different pseudoflavour are connected. 

When one wants to compute propagators of fermions by use of the multigrid approach, 
then one should in principle use interpolating kernels A which possess Diracian smoothness. 
Unfortunately this is impractical for reasons of storage space. Laplacian smooth kernels A are 
much more practical. Possible compromises are discussed in . 

In Sect. ^ the problem of smoothness of the interpolation kernel A (see Sect. ^ will be 
readdressed using optimization wisdom borrowed from neural computation. 

^ Those readers who are not famihar with multigrid methods may first want to read Sect. H; a more detailed 
mathematical account can be found in pil . 
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Figure 4: The good choice of blocks to average over. C{z,x) is only nonvanishing if site z 
has the same pseudoflavour (symbol) as the block-center x (encircled symbol). Therefore the 
seemingly overlapping blocks (square) have actually no sites in common. 



Special Choice which yields a Good Blockspin 

In the language of Sect. H, we know that we have made a good choice of blockspin if the effective 
action remains local after arbitrarily many or arbitrarily big blocking steps. The averaging 
kernel C{x,z) which is nonvanishing only if z and x have the same pseudoflavour, and which 
satisfies the Laplace equation (^) with Neumann boundary conditions on block boundaries, 
has proven to be good in that sense. This result extends to blocking from continuum action with 
2['^/^l flavours to staggered fermion field on the lattice. Details will be given in the next section. 
Fig. ^ visualizes the overlapping of blocks which do not share sites of the same pseudoflavour 



as the block centers. In numerical investigations |^2[ non-overlapping blocks were used as well. 
But then one does not retain the successful pure gauge limit. 
A successful block spin requires 

2['^/2] flavors. There is no consistent way of putting a single 
chiral particle on the lattice (this is the statement of the theorem by Nielsen and Ninomiya 0): 
Therefore there is no good blockspin either for a single chiral field, as we had already remarked 
in Sect. |1|. 



8 Efficient Computation of Gauge Covariant Propaga- 
tors 

A quenched simulation of lattice QCD requires the evaluation of products of Dirac propagators 
S{U). S depends on the actual gauge field configuration U. In a Monte Carlo simulation of 
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full QCD, one needs also the fermion determinant 



det-^^^SiU) =U-^ / V<p exp[-(0,5(f/)0)] 



Since for every new gauge field configuration f/, S{U) needs to be recomputed or updated, it is 
important to have an efficient method to do so. The conjugate gradient (CG) or the minimal 
residual algorithm is state of the art. Great hopes to do better are attached to multigrid (MG) 

methods pq, wi pm B3, m m,m,m. 



Deterministic Multigrid Methods 

For convenience, we will use the opposite convention in enumerating the lattices of a multigrid 
in this section. The fundamental (i.e. finest) lattice is denoted by A = A°. There will be a 
sequence of lattices A°, A^, A^, ... of increasing lattice spacing a^, viz. Oj+i = LbOj with = a. 
We wish to solve an inhomogeneous linear equation 

^oXo = /o (94) 

on the fundamental lattice A = A°, for given /q. In our case, = -^72 + m2. After some 
relaxation sweeps on hP one gets an approximation xo to Xo which differs from the exact solution 
by an error cq = Xo~Xo- The fundamental idea of the MG to the solution of partial differential 
(or difference) equations ||3l|, ^ is that the error cq should become smooth very fast, although 
it may become small very slowly because of critical slowing down (CSD). The error satisfies 
the equation 

Doeo = ro (95) 

which involves the residual ro = /o — -DoXo- If eo is smooth, it is determined to a very good 
accuracy by a function ei on the next coarser lattice A^, and can be represented in the form 

eo = Aei (96) 

with an interpolation map A which should be so chosen that it maps functions on A^ into 
smooth functions on A°. Conversely, ei can be obtained from eo with the help of an averaging 
map C which satisfies 

CA = t . (97) 
It follows that Ci = Ccq. Inserting (PB) into (PH) and acting on the result with C, we see that 



ei will satisfy the equation 

Did = ri (98) 

with 

Di = CDqA , ri = Cro . (99) 

The problem has been reduced to an equation on the coarser lattice. If there is still too much 
CSD at this level, one may repeat the procedure, going to coarser and coarser lattices. 

Given A, a possible choice of C which satisfies ( pTj) is C = {A*A)^'^ A*. The kernel A{z, x) 
should be a smooth function of z. The appropriate notion of smoothness depends on the 
dynamics, i. e. on Dq, in general. Smooth means little contributions from eigenf unctions to high 
eigenvalues of Dq. This insight is confirmed by results reported below. The point is important 
in systems in gauge fields and for other disordered systems. 
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Gauge Covariance 



In a gauge theory, kernels A and C should be chosen in a gauge covariant fashion; see the 
discussion in Sect. ^ Under gauge transformations on A° they transform according to 

A{z,x) ^ g{z)A{z,x)g{xy^ , 

C{x,z) ^ g{x)C{x,z)g{z)-^ . (100) 

This is consistent with CA = 11. 

The most general expression of a kernel A with this covariance property is a weighted sum 
of parallel transporters U (C) along paths C from x to z, e. g. 

A{z,x)= U{C) , (101) 

C: X h^z 

where g{C) are numbers. And analogously for C. We will prefer not to specify the weights g 
explicitly, but to determine A, C as solutions of covariant equations. 



Ground-State Projection Multigrid 

In the ground-state projection MG method the averaging operator C from a grid to the next 
coarser grid is a projector on the ground-state of a local Hamiltonian. The adjoint of C satisfies 
a gauge covariant eigenvalue equation (cf. (pSj), (pl|)), 

(-Ab.c.C*)(^,x) = eo{x)C*{z,x) . (102) 

Remember that the kernel C* {z, x) which solves ( |102| ) is an x matrix which is in general 
not an element of the gauge group. {N^. is the number of colours.) 

For bosons one may choose Neumann boundary conditions on block boundaries, — Ab.c. = 
—Ajy^x, as discussed in Sect. ||, (^). The lowest eigenvalue eo(x) of —Ab.c. is a gauge invariant 
quantity which is a measure of disorder. A normalization condition CC* = 11 may be imposed, 
and the covariance condition C*{x,x) > (as a matrix) makes the definition of C unique. 
Kernels defined by (|102|) and C*{x,x) > enjoy gauge covariance (|100|) . 



For staggered fermions, the Laplacian should be regarded as living on the sublattices which 
consist of sites with the same pseudoflavor. It connects only sites with the same pseudoflavor. 
Neumann boundary conditions on the Laplacian on the sublattice of sites z with the same 
pseudoflavor as x must be supplemented by the requirement that C (x, z) = unless z and x 
have the same pseudoflavor. This extra requirement could be regarded as a consequence of 
supplementary boundary conditions. 

There exists an efficient algorithm for computing C ||TB[. The programs are fully vectorized. 
In 4-dimensional SU{2) gauge fields, computation of C on the whole lattice costs CPU time of 
the order of one heatbath sweep for the gauge field. Thus, it is not too expensive to solve 



and it is not necessary to sacrifice gauge covariance by working with gauge field independent 
kernels in gauge-fixed [/-configurations. (Gauge fixing costs also CPU time.) 



Interpolation Kernel A 

Given the averaging kernel C, there exists an ideal choice of the interpolation kernel A. It is 
determined as follows. For every function ( "blockspin" ) $ on A^, = A^ minimizes the action 
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Figure 5: Optimal interpolation kernel A{z, y) for bosons on an 18^ lattice in a quenched SU{2) 
gauge field equilibrated at /3 = 2.7. {Lb = 3, periodic b. c.) Shown is a two-dimensional cut 
through the block center y = 0, and are fixed. The vertical axis gives the trace norm of 
A{z, y). (Remember that A{z, y) G IR ■ SU{2).) 

H = (0, -Do0) subject to the constraint Ccp = Recalling the definition of smoothness, this 
can be rephrased: = is the smoothest possible function with prescribed block average 
$ = C(j). It follows that A satisfies (|106|) below. With this choice of A, the action Ti completely 
decouples into a sum of actions for the different MG layers. Moreover, Di = {CC*)A*DoA] 
this is selfadjoint for our C-kernels which satisfy CC* = 1. 

A good "choice of blockspin" , i.e. of C, is characterized by the fact that the ideal kernel 
A{z,x) {z G A", X G A^) associated with it has good locality properties. This means that 
A{z,x) is big for 2 G x, and decays exponentially in \z — x\ with decay length 1 block lattice 
spacing ai (cf. Sect. ^). Numerical computations of the ideal kernel A were done in quenched 4- 
dimensional SU (2) gauge fields at various values of P = 4:/g^, including the case of a completely 
random gauge field (/? = 0). (For bosons Dq = —A+m?, while for fermions Dq = — 'p'^+m? with 

= or small) . It was verified that for any /5- value the definition of the averaging kernel C as 
a solution of a gauge covariant Laplace eigenvalue equation (|55D, ( pT]) (with Neumann boundary 
conditions on block boundaries) yields a good choice of blockspin in the sense described above. 
This is true both for bosons and for staggered fermions; see Figs. ^ ^ ^ decays exponentially 
over distance Oi (nearly) as fast in the presence of gauge fields as in their absence. 

Because of the exponential decay of the optimal kernel A., one may try to approximate it 
by a simpler choice which fulfills A{z^ x) = for z ^ a. certain neighborhood of block x. This 
neighborhood might be taken to be only the block x itself. The coarse grid operator would 
then be defined through Di = A*DqA. 
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It follows from the optimization criteria discussed below that A should have the property 
that the lowest eigenmode 0o of Dq should admit a representation of the form 0o = with 
a suitable $ to a very good accuracy. The proposal of to define A through the eigenvalue 
equation ( |102| ) with — Ab.c. taken to be a local Hamiltonian with Dirichlet instead of Neumann 
boundary conditions does not meet this requirement. 

For periodic boundary conditions the Galerkin choice (also called variational coarsening) 
^ = C* is better. In this case Di can be defined as Di = 5~^CDqC* The fluctuating 



length of its matrix elements can be adjusted to fluctuate around the "right" value (of the exact 
Di{x,y)) by tuning the real parameter 6. 

Numerical Results for Propagators Computed by MG 



Ground-state projection MG computations for propagators were done in 2-d U{1) ^ 
and (approximate ground-state projections) in 2-d and 4-d f/(l) and 2-d SU{2) gauge fields 
p4| , [36| , |32| . The first ground-state projection MG computation of gauge covariant propagators 
in 4:-d non-Abelian gauge fields (for gauge group SU{2)) was presented in Ref. Besides 
ground-state projection there is another approach for choosing the averaging kernel C. This 



approach is called "parallel transported MG". It is based on the representation ( |101| ) and is 
studied in Refs. . 



Bosonic Propagators in 4-d SU{2) Gauge Fields. 

For bosons, Dq = —A + m^. The MG method is of interest near criticality, i. e. for slowly 
decaying propagators. For nontrivial gauge fields we enforce slow decay by choosing negative 
and very close to the negative of the lowest eigenvalue —ml^ > of — A.^ The relaxation time 
T - defined by the asymptotic exponential decay of the norm of the residual or error - behaves 
like T oc (Am^)~^/^ for small Am^ = m? — m1^. 

Results in 4-dimensional SU{2) gauge fields equilibrated with Wilson's action at various values 
of /5 H: 



The MG algorithm with the ideal interpolation kernel A eliminates CSD for any value 
of the gauge coupling; see Table |^. These computations were the first without CSD in 
non-trivial gauge fields. They prove that ground-state projection is a good choice of C, 
and that the MG method can cope with the "frustration" (disorder) which is inherent 
in non-Abelian gauge fields. However, the optimal A{z,y) is not translational invariant 
(except for U = 1) and has support on all sites z of the fine lattice, for all sites y of the 
block lattice. Therefore the use of this kernel for production runs is impractical, but it 
was important to answer questions of principle. In particular this result confirms that 
there exists an appropriate notion of smoothness in disordered cases, as described above. 
The ideal A is the smoothest kernel which obeys CA = 1. 

The most practical MG algorithm with A = C* does not eliminate CSD in non-trivial 
gauge fields, z = 2 remains (as in 1-grid relaxation), although z = in pure gauges; see 
Fig. ^. However, MG is able to outperform the conjugate gradient algorithm; see Fig. 



^Relaxation times of conventional iterative algorithms depend only on and not on the lattice size |A|; 
this is contrarily to the Dirichlet problem, for instance. In case of periodic b. c. there is only an implicit (slight) 
dependence on |A| through the value of rri^j.. 
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Table 2: Results of the idealized MG algorithm with lexicographic SOR (relaxation parameter 
u) on 9^ lattices A°. Kernel A is the solution of the equation ([—A + m^^, + k C*C]A) {z, y) = 
K C*{z, y) for large k. This expression tends to the ideal ^-kernel described in the text for large 

K. 



p 


ml 


optimal uj 


r for Am^ < 10"^ 


oo 





1.27 


1.6 


2.7 


-0.8210607 


1.38 


1.9 


2.5 


-0.9477085 


1.40 


1.9 


2.2 


-1.2218471 


1.45 


1.9 


1.8 


-1.7567164 


1.57 


2.5 





-2.7480401 


1.69 


5.2 



Figure 6: Computation of bosonic propagators (—A + m^j. + Am^)"^. Relaxation times r (in 
comparable work units) of iterative algorithms on an 18*^ lattice in a quenched SU{2) gauge 
field at /3 = 2.7 with m^j. = —0.7554339. (MG with A = C*; Jac means Jacobi relaxation, 
while SOR stands for successive over-relaxation.) 



After the publication of Ref . , a practical nonlinear MG algorithm has been developed 
which eliminates the appearance of CSD when the mass is lowered, some volume depen- 
dence of the correlation times still remains, though. Details will be reported elsewhere 
Pf. 



Propagators for Staggered Fermions in 4-d SU{2) Gauge Fields. 

Of the two different proposals for the averaging kernel C made in [^, only the Laplacian 
choice has been implemented numerically yet. There, — Ab.c. = — Atv,^ in ( |102| ), where A is 
the fermionic 2-link Laplacian defined in (^3]). The optimal interpolation kernel A associated 
with this C has the desired falloff properties in non-trivial gauge fields; see Fig. ^. Thus, the 
Laplacian choice for C defines a good blockspin for staggered fermions, and therefore it seems 
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Figure 7: Computation of bosonic propagators (—A + m^j. + Am^) ^. Convergence on an 18^ 
lattice in quenched SU{2) gauge fields at (a) /3 = 10.0, and (b) P = 2.7, with m^j. = —0.1533739, 
resp. -0.7554339. (MG with A = C*.) 



not to be necessary to implement the more intricate proposals mentioned in However, 
numerical simulations with the optimal A have not been performed yet. 

MG with variational coarsening A = C* works, but up to now we had to restrict ourselves to 
relatively small lattices (up to 18^) where this MG algorithm is not competitive with conjugate 
gradient. The authors of Refs. |^0| report that their algorithm performs considerably better 
when they re-unitarize the blocked gauge field (matrix elements of the effective Dirac operator) 
such that it is again an element of the gauge group. However, our results indicate that the 



difference between using a unitary or a dielectric gauge field [T^\ on coarser layers can be 
compensated by adjusting the real parameter 6 in Di. By tuning 6, one can re-unitarize the 
blocked gauge field "on the average" . 

We are optimistic that the implementation of the nonlinear MG algorithm found for bosons 
(see above) also gives a practical and competitive MG method for fermions. This implementa- 
tion is currently investigated. For details refer to [OT. 



9 Neural optimization of deterministic multigrids 

The results of Sect. ^ confirm that the multigrid approach is able to handle inhomogeneous wave 
equations in disordered media, e.g. in the presence of gauge fields, but the appropriate notion of 
smoothness depends on the dynamics, i.e. on the operator Dq in the equation which one wishes 
to solve. The interpolation kernels A should be smooth. Smooth means little contributions 
from eigenfunctions to high eigenvalues of Dq. 

This means that the appropriate kernels A are dynamically determined and need to be 
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Figure 8: Optimal interpolation kernel A{z, y) - associated with the "Laplacian" choice for C - 
for staggered fermions on an 18^ lattice in a quenched SU (2) gauge field equihbrated at /3 = 2.7. 
(Lb = 3, periodic b. c.) A is the solution of the equation {\^— yi"^ + k C*C]A^ {z, y) = k C*{z, y) 
for large k. Shown is a two-dimensional cut through the block center y = 0, 2:3 and 2:4 are fixed. 
The vertical axis gives the trace norm of A{z,y). (Remember that A{z^y) G IR ■ SU{2).) Note 
the support properties of A, it has support on all even lattice sites and is vanishing on all odd 
sites. This is due to the non- vanishing field strength at finite (3. In the limiting case of pure 
gauges the field strength term a^^F^j^i, vanishes and the support of A is reduced to 1/16-th of 
lAI. Then we have 16 bosonic ^-kernels on 9^ sublattices. 
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found before or together with the solution of the equation. (The averaging kernel C is often 
related to A.) The question is how. There are interesting potential applications other than 
gauge theories |^. Therefore a general strategy is desired. 



Criteria for optimality 

One may write down a cost functional whose minimum yields the optimal kernels, i.e. the 
fastest possible convergence of the multigrid iteration. 

For definitenes and simplicity, think of a twogrid. Assume that updating consists of standard 
damped relaxation sweeps on the fundamental lattice to smoothen the errors, followed by an 
updating step Xo ^ Xo — Aei which involves the exact solution ei of (^) on the coarse grid 
A^, with some Di which is to be determined together with A and C. 

Convergence speed is estimated by the norm of the iteration matrix p. The iteration matrix 
is the product p = poPi of the iteration matrix Pq for the smoothing, and the iteration matrix 
Pi for the coarse grid updating. The norm 

IIpII < ll^opoll • • 

Only the second factor depends on A, C and Di. Inclusion of the operators D^^ in the factors 
reflects the philosophy that smoothing suppresses the high frequency modes of the error pi 



Choosing the trace norm, one is lead to determine A, C and Di from the minimality condition 
for Ei = ||Do"Vif , 

mm = El = f f \T{z,w)\^ (103) 



T{z,w) = v{z,w) — / / A{z,x) u{x,y) C{y,w) (104) 

V = Dq^ and u = D^^ (105) 

The minimality condition leads to conditions on kernels A, C and on Di. Suppose first that 
no constraints of practicality are imposed on the kernels. We assume that Do is selfadjoint. 
Optimizing A, for given C and u = -Df ^, one finds the condition TC* = 0. That is, 

DoA = C*u-^ (106) 

with u = uCC* . This reproduces the familiar condition for an ideal y4.-kernel. Variation of 
the cost functional with respect to C yields = 0. This is equivalent to a second condition 
which is obtained from (|106|) by interchange of the role of A and C* and substituting D* for Di. 
Variation with respect to Di yields no new condition. These conditions leave much freedom. 

But practicality requires that one imposes restrictions of the form A{z, x) = except when 
z is sufficiently close to x, and similarly for C and Di. (We will regard A^ as a sublattice of 
A*^.) The proposal is to find the optimal kernels which satisfy these constraints by minimizing 
El. 



For instance, one may admit link variables J7(z, w) in the covariant Laplacian A which take values or 1. 
Links (z, w) with J7(z, w) — Q are considered deleted from the lattice. If deleted links are distributed at random 
with suitable probability, on obtains a fractal lattice. Solutions of the equation (—A + Af^)xo = /o admit a 
random walk representation on this lattice. 

^^For instance, one may demand that C = {AA'Y^A* or, equivalently, A = C*{CC*)^^. Then CA ~ 1, 
the second condition follows from the first, and F agrees with the fluctuation field propagator ( ^0| ) with Di = 
-CC*Aeff if Do - -A. 
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Figure 9: Neural net from a twogrid (A°, A^) in one dimension. It has one input layer A'', one 
output layer A° and one layer A^ of hidden nodes. 

The idea was tested on a baby model, the 1-dimensional Poisson equation 

(-A + xo = fo (107) 

on a chain A'' of lattice spacing 1, with very small M. A^ consisted of the even sites in A*^. As 
a condition of practicality it was demanded that 

C{x, z) = = A{z,x) unless \z — x\ < 1 . 

Optimization via gradient descent yielded the expected result, in the limit of small M, 

C{x,x) = 1 , C{x,x±l) = ^ , Di = 2{-Ai + 4:M^) (108) 
Aiz,x) = C{x,z) {xeA\zeA^) , (109) 

modulo normalization conventions. Ai stands for the Laplacian on A^. 
Neural net approach 



It is interesting to rephrase the problem in the language of neural nets |^ . 

We regard the layers of the multigrid as layers of a perceptron. The sites of the lattices 
are called nodes in this context. In the twogrid case there will be an input layer with nodes 
z e A°, a hidden layer with nodes x E which are sites of the coarse grid, and an output 
layer with nodes w G A°. Nodes z of the input layer are connected to nodes x of the hidden 
layer by "neurons" with connection strength C{x, z), so that node x receives as input a linear 
superposition of the output of the input nodes z. Nodes x of the hidden layer are connected to 
nodes w of the output layer by "neurons" of connection strength A{w,x) (see Fig. y). 



The network receives as input an approximation ^{z) to the solution of the equation (94) 
which we wish to solve, and performs an iteration step in order to obtain as output an improved 
approximation 0{z). The desired output is the exact solution ((z) = D^^/q. If the iteration 



step involves the exact solution of the coarse grid equation (|98|) then the network is not strictly 
feed-forward. Q This is because the hidden layer needs to do computations more general than 
the usual computation of some nonlinear function at individual nodes in order to solve the 

^^A neural network is called "feed-forward" if the nodes in a layer receive their only input from nodes on 
preceding layers Q. 
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coarse grid equation. This could be remedied if one envisages iterative solution of the coarse 



grid equation (|98D . We return to this below. 

The network is supposed to learn to compute an output O which is as close as possible 
to the target C- The learning process consists in adjusting the connection strengths C{x,z) 
and A{z, x). One envisages that a training set is presented to the network to teach it. 

It involves a sequence of inputs The network has its output compared to the desired 
answer and receives feedback about the error which is then used to adjust the connection 
strengths according to some learning rule. Application of typical learning rules amounts to 
iterative minimization of some cost functional E. The standard cost functional is 

fl z 

In our case, the output is a linear function of the input and of /o which has the exact solution 
-Do Vo as a fixed point. Therefore 

o = pe + (i-p)^o"Vo , 

where p is the iteration matrix. The target C'^ = Dq^ fo is independent of the input. The cost 
functional becomes 

^ = E|K^-^o"Vo)f 

where = (/, /) = J2z Since the equation is linear, we may imagine without loss of 

generality that /o is arbitrarily small. Assuming the training input is a complete orthonormal 
set of functions on A° in the limit of small /o, the cost functional becomes 

E=\\pf^ Tripp*) ■ 

This is the same cost functional as considered before. 

We could restrict our attention to the coarse grid updating whose updating matrix is pi. In 
this case one wants mainly to suppress the low frequency modes of the error. This motivates 
us to propose a training set with ^'^ such that DqC,'^ are orthonormal. As a result one finds 
that El should be minimized, where Ei = WDq^piW^ as before. Minimization of Ei leads to 
less practical learning rules than E because of the presence of the operator D^^, though. 

Let us return to the question of the implementation within the framework of feed-forward 
networks. 

We assume that the input layer knows to compute vq^z) = DqC,{z) — fo{z) from the input ^. 
This is easily implemented within the standard feed-forward framework by adding an extra 
layer. Initial fine grid relaxation sweeps to smoothen tq are also easily implemented. The 
nodes x of the hidden layer will receive input ri{z) = J2zC{x, z)rQ{z) that is computed from 
the smoothened version of tq. 

Let us imagine that we do not use the exact solution ei to the coarse grid equation (pSD , 
but content ourselves with an approximation to —ei{x) which comes from a single Jacobi 
relaxation sweep. Then the node x is asked to compute the output 

E{x) = -di{x)~^ ri(x) , (110) 

from its input ri(x), which it can legally do. di is the diagonal part of Di, viz. 

Dif{x) = di{x)f{x) + contributions oc f{y) with y ^ x . 
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We imagine that there are additional connections (not shown in figure (|^) ) of strength 1 from 
input nodes to output nodes which represent the same lattice point z. The total input to the 
output node z is then 

=eW + E^(^>^) '^{x)^i{z)-Y.A{z,x) ei(x) , (111) 

X X 

and this is also the output from the output node z. 

Updating of connection strengths C{x^z^ and A{x^z^ proportional to the gradient of the 



cost functional E leads to a standard back-propagation learning rule as described in As is 
discussed there (Sect. 6.1) the same network - or rather a bidirectional version of it - can be 
used to compute the necessary adjustments. 

There is still one nonstandard feature. The network should also learn di{x)~^. This becomes 
a standard learning problem (to be solved by the back-propagation learning rule) if we think 
of duplicating the hidden layer, connecting the duplicates of site x by "neurons" of connection 
strength di{x)~^. 

We discussed a twogrid for simplicity. Generalization to a multigrid is obvious and will be 
presented elsewhere H^j. 



10 Blocked Gauge Fields 

Remember that the gauge covariant Laplace operator A for gauge field U has kernel (a = 1) 

A{Z,W} = < rrf . .f (112) 

\ U[z,w) n zn.n.w . 
in d dimensions. And similarly for the staggered Dirac operator^ 

vj^^ = i V^{z)U{z,w) ifw = z + le^,fi = -d,...,d,fij^O, 
1 otherwise . 

A blockspin definition for matter fields gives us the effective operators on coarser layers, 
Aeflf = A*AA and X^^g = A*'J(^A. Given Aefr or X^gfj, we have candidates for gauge fields U on 
the block lattice, 

U{x,y) = Aefr(x, y) for x,y nearest neighbours 
resp. U{x,y) = f ^^{x,y) / ri^{x) iiy = x + U\e^. (114) 

These expressions for U (x, y) have the correct gauge covariance property. 

We add some comments on the Dirac case. Since we use odd Lb, the staggered block lattice 
can be regarded as a sublattice of A in which the pseudofiavor of a site does not depend on 
whether we regard it as a site of the fundamental or the block lattice, and the same is true of 
its property of being even or odd. The kernel A will always have the property that A{z, x) = 
unless z and x are both even or both odd. Therefore X^e// connects even to odd sites. The factor 
Vfi{^)~^ can be regarded as defined by regarding a; as a point in a sublattice of the fundamental 
lattice. This agrees with the natural definition on the block lattice. Note also that f]^{z) is 
independent of the //-component z^) of z. 

^^We use the conventions 77-^ = —77^ and = — e^. z is called odd if the sum of its integer coordinates is 
odd. 
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The blocked gauge fields U (x, y) are actually "dielectric" gauge fields 



They may be 

brought back into the gauge group by multiplying with a positive real number (for U{1) and 
SU{2)) or with a positive matrix (for U{N), N > 2). 

A different choice of blockspin for gauge fields was introduced by Balaban and used in 
rigorous renormalization group work 



11 Simulation Methods for the Computation of Effective 
Actions 



The standard Monte Carlo simulation techniques for the computation of functional integrals 
can be used to compute effective actions, and in particular effective potentials. 
There are three methods to take into account the 5-function constraints. 

1. ) Gaussian blockspins. Use a Gaussian in place of the 5-function in the constraint. 
This definition of a block spin transformation is very convenient for numerical work. But it 
has some inherent dangers which have been discussed in the literature (cp. [Q): Fictitious 
marginal operators may appear and cloud the renormalization group fiow. Some numerical 
results are found in . 

2. ) Simulations at fixed blockspin $ [^. Do the updating so that $ is fixed. Compute 

{dn/d^{x))^ 

This determines Heff{^) up to a constant. 

For linear blockspin transformations, the quantity dT-C/d^(x) is defined as follows: 



on 



dip{z) 



A{z, x) 



(115) 



so that the action Ti changes by 60. 



on 

(9$(x) 



6^{x) when cp is changed by an amount Sip{z) 



A{z,x)6^{x). Simulations with fixed block spin are simulations for an auxiliary theory with 
an infrared cutoff a^^. Therefore the critical slowing down will be governed by the ratio ao/a 
which is usually appreciably less than the correlation length in units of a. 

This method is general, but it is costly when one wants to compute more than effective 
potentials. 

3.) Method of "fluctuating coupling constants" [^, |15| This method is applicable in 
cases such as Higgs models with and without gauge fields, where the effective action depends 
only on matter fields $ which are related by a linear blockspin definition to matter fields which 
appear polynomially in H. 



Given (f, U then the high frequency field ( is defined. A sequence of configurations {ipa, Ua) 
yields therefore a sequence (C^, Ua), a = 1, 2, ... , Nconf- Expanding 



(116) 



defines a sequence g"^ of "fiuctuating coupling constants". They are stored. At the end of the 
run, the effective potential can be recovered from them. For a 1-component field 



exp 



;ii7) 
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where Za is the 1-dimensional integral 



(118) 

One may choose to expand around another point than $ = instead. 

In principle, fluctuation coupling constants could also be used to encode the information 
about full effective actions. But in this case would be functions of m arguments xi, . . . , 
on the block lattice. Only few of them could possibly be stored. 



(i$exp 



Multigrid Updating of Matter Fields 

The multigrid method provides a tool to fight critical slowing down in Monte Carlo simulations. 
It uses or is motivated by constructs that we discussed. Therefore we would like to mention it 
at least briefly. 

Imagine that the blockspin attached to some block lattice A-^ of lattice spacing is 
changed at site x G A-^ by This will induce a change of the matter field 

should be thought of as minimizing the kinetic energy subject to some constraint (such as 
Cif = $), therefore it should be smooth in z (no jumps). If we do not specify the averaging 
kernel C, there are no further restrictions on A . 

Similar updatings are possible for fields subject to constraints such as unit vectors in a- 
models, spins in CP"- models etc. For instance 

ip{z) i-^ e'xp[A{z,x) e] (f{z) , (119) 

where the matrix e is an element of the Lie algebra of the symmetry of the model. 

In the "unigrid" approach of Hasenbusch, Mack and Meyer [Q, hypercubes x of different 



side lengths are considered and updatings of the form (|1 19|) are performed with smooth kernels 
A{z, x) which vanish for z outside x. The positions of the hypercubes x are chosen at random 
in the lattice. It turns out to be important for the effectiveness of the method that the selected 
hypercubes x of one length have some overlap. 

The unigrid method proved very successful for 2-dimensional asymptotically free models: 
The critical exponent z which governs critical slowing down is nearly zero in this case. Refer- 
ences are for the CP^-model and for the 0(3)— model. It is essential for the success 



that these models have no mass parameter The effective potential for 4-dimensional 

(y?"^— theory was computed using a similar updating scheme |T^, but with disjoint hypercubes 



X which form a lattice, and with kernels A which had support on hypercubes x and on their 
nearest neighbours. 
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Appendix: Constraint Effective Potential with Gauge Fields 
in Finite Volume 



The result of the 1-loop calculation of Sect. |^ reads as follows (for details see 



[/^(t.) = [/<?>(*) + _Tr 



in i ' 



6 



2 SAj-z^ 5A^zy 



A=0 



with 



(120) 

(121) 
(122) 



where 



4^)($,A) = ^(f/M$)/$)E'7#TiEl^-oP + 4l7E'(rn<^o(<^') + 3r„,,($)) 

n r n 

E(^Ln + ilAnol') - j^j:'{2T^^^i^)T^^^{<!?) + r„^,($)r„^,($) 

r " n,m 

+r„<^,($)r^<^o($))El^nm|' + f§<f2E'(4rn<^.(*) + Tn^ol*)) 



_i VI 4 |2 



(123) 



with 



(124) 



t/,K<f) = im2$2 + Aa$4^ fc2 =4E;.sin2(fn^,, 2 - -/^ ^ 2 

r„^„(<i.) = + u'j,m-\ r„,^.(<f ) = + f/M<i>)/<i>)-^ 

$ the absolute value of the covariant magnetization of (^) and Amo, ^Inn^ ^mm defined by 

(^r)oo = ^ E^O {^)^rz-fifj,Arz-i_ifj.4'o{^ ~ /^) 



(^r)?io 



Ar 



^EC (^)^rz-MM^r^-MM^o(2 " /^) 

Zfl 

^ E (^)^r^-/./. (^ " /i) 



(125) 



2/i 



4 4* 



Here, '?/'„(-2) = exp{^nz) and the eigenvalue eo is given by (ffOl). ?7^; and f/"; are the first and 
second derivative of Ud, and prime ' on the sum means that the n = term is omitted. 

The trace in ( |120[ ) involves summation over the discrete 0(4)-matrix indices and a sum over 
space-time points on a lattice A. 
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